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CORRELATION OF SEQUENCES AND OF MEASURES, 
GENERIC POINTS FOR JOININGS AND ERGODICITY OF 
GERTAIN GOCYGLES 


JEAN-PIERRE CONZE, TOMASZ DOWNAROWICZ AND JACEK SERAFIN 

Abstract. The main subject of the paper, motivated by a question raised 
by Boshernitzan, is to give criteria for a bounded complex-valued sequence 
to be uncorrelated to any strictly ergodic sequence. As a tool developed to 
study this problem we introduce the notion of correlation between two shift- 
invariant measures supported by the symbolic space with complex symbols. 
We also prove a “lifting lemma” for generic points: given a joining ^ of two 
shift-invariant measures /i and every point x generic for fi lifts to a pair 
(x^y) generic for ^ (such y exists in the full symbolic space). This lemma al¬ 
lows us to translate correlation between bounded sequences to the language of 
correlation of measures. Finally, to establish that the property of an invariant 
measure being uncorrelated to any ergodic measure is essentially weaker than 
the property of being disjoint from any ergodic measure, we develop and ap¬ 
ply criteria for ergodicity of four-jump cocycles over irrational rotations. We 
believe that apart from the applications to studying the notion of correlation, 
the two developed tools: the lifting lemma and the criteria for ergodicity of 
four-jump cocycles, are of independent interest. This is why we announce them 
also in the title. In the Appendix we also introduce the notion of conditional 
disjointness. 


The research of the second and third author is supported by the NCN (National 
Science Center, Poland) grant 2013/08/A/ST1/00275. 


1. Basic notions and motivation 


Definition 1 . 1 . For finite sequences (blocks ) A = [ai, 02,..., a„], B = [bi,b2, ■ ■ ■ ,bn] 
(of the same length n), consisting of complex numbers, we define 



Two complex-valued bounded infinite sequences x, y are declared uncorrelated, weakly 
correlated, or strongly correlated, if 


limcorr(a;[i,„],y[i,„]) = 0, 

linisup|corr(x[i,„],y[i,„])| > 0, 


n 


liminf |corr(x[i,„],i/[i,„])| > 0, 


respectively. (We are using the following notation: is the block [xi, X2, ■ • ■, Xn]-) 

Notice that if at least one sequence has zero mean, then the correlation uses only 
the first average (of the products). 
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Although most of our results apply to sequences in the complex space , for 
simplicity of the forthcoming arguments we will restrict our attention to the space 
of sequences taking values in a finite subset K of the unit disc. This will allow 
us to use a range of measure-theoretic and topological tools applicable for subshifts. 
We will call K the alphabet, and N denotes the set of positive integers (so that the 
enumeration of our sequences always starts with the index 1). 

The famous Sarnak Conjecture (see [Sarj l asserts that the Mobius function (jl is 
uncorrelated to any deterministic sequence x where fi is the “signed characteristic 
function” of square-free numbers: 


/x(n) = < 


1 if n = 1, 

0 if n has a repeated prime factor, 

(—I)” if n is a product of r distinct primes, 


and a deterministic sequence is any sequence x of the form 


Xr, = f{T^a), 


where / is a continuous complex-valued function defined on a topological dynamical 
system {X, T) (i.e., T : X —^ X is a continuous transformation of a compact metric 
space X) of topological entropy zero and a € X (equivalently, the shift orbit-closure 
of X has topological entropy zero). Observe that the set of square-free numbers has 
positive density in N, so the conjecture is not trivial. An up-to-date exposition 
of classes of systems for which the conjecture holds, as well as some new results, 
can be found in [AKLR] . While Sarnak Conjecture is still far from being solved, 
our research is motivated by a similar problem (attributed to Boshernitzan), stated 
below. A sequence x is strictly ergodic if it is of the form 

Xn = /(r"a), 


where / is a continuous complex-valued function defined on a strictly ergodic topo¬ 
logical dynamical system {X, T) and a G X (equivalently, the shift orbit-closure of 
a; is a strictly ergodic dynamical system). 


Question 1.2. Is the Mobius function uncorrelated to any strictly ergodic sequence? 


We have tried to understand what kind of question this is and what it means 
for a sequence to be uncorrelated to any strictly ergodic sequence. As it turns out, 
the condition defines an interesting, nontrivial class of sequences. 

Let us now recall the notion of a generic point. By invariant measures we will 
always mean T-invariant Borel probability measures on X. 

Definition 1.3. A point x in a topological dynamical system {X,T) is generic for 
an invariant measure p if 

1 f 

lim-^/(T*a:) = f dp, 
n n ^^ / 

2=1 

for every continuous (real or complex) function f on X. 

We recall the well-known fact that generic points always exist for ergodic mea¬ 
sures, in which case they form a set of full measure. Moreover, if a topological dy¬ 
namical system is uniquely ergodic then all of its points are generic for the unique 
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invariant measure. It is also well known that in the full shift over a finite alpha¬ 
bet every shift-invariant measure has a generic point. An important simplification 
which arises for symbolic systems is that for genericity of a point it suffices to verify 
the convergence for the countable family of characteristic functions of cylinder sets. 
Moreover, each point x is now a sequence and the ergodic average of length n for 
the characteristic function of a cylinder associated to a block B corresponds to the 
frequency with which B appears in the initial block . Thus x is generic 

for a shift-invariant measure ^ if and only if the following holds, for every positive 
integer m and every block B G A'™: 

(1.1) fj,{B) = lim i#{i e {1, 2,..., n} : X[,^i+rn-i] = B} 

n L I j 

(by writing /i(i3) we identify blocks with their associated cylinder sets in K^). We 
will need a similar notion of semi-generating a measure along a subsequence: 

Definition 1.4. We say that a sequence x G semi-generates a shift-invariant 
measure fi along a subsequence (n^) if the following holds, for every positive integer 
m and every block B G K"^: 

(1.2) p,{B) = lim €{1,2,..., Uk} : X[i^,+rn-i] = B}. 


By the weak-star compactness of the space of probability measures, given a point 
X G K^, every subsequence (rifc) contains a sub-subsequence along which x semi- 
generates an invariant measure. Now, given two points, x and y, we can treat 
the pair {x,y) as an element of {K x K)^ (which is also a symbolic space), hence 
every subsequence (uk) contains a sub-subsequence [uki ) along which the pair {x, y) 
semi-generates an invariant measure f on (AT x AT)^. It is elementary to see, that 
along every such sub-subsequence (ufe,) all the limits involved in the definition of 
correlations, i.e., 



exist and equal the respective integrals 


xiyidf, 



/ 


yi diy, 


where /i and v are the marginal measures of f (i.e., ^ is a joining of y, and u). 

Of course, the most convenient situation occurs when the points x and y are 
generic for some invariant measures. In this case the entire sequences ^ 

n Sr=i yi (but not -b X]r=i ^ilh) converge, which simplifies many arguments. 
Because we are interested in studying correlation with strictly ergodic sequences 
(and every element of a strictly ergodic system is generic), we can immediately 
assume that our sequence y is generic. It is thus reasonable to first look at sequences 
X which are also generic (we will extend our results to general elements x in the 
last but one section). In this manner we are led to the following question: 

Question 1.5. When is a sequence x G K^, generic for a shift-invariant measure, 
weakly (strongly) correlated to a strictly ergodic sequence? 


The first (but already rich in consequences) reduction of the problem relies on 
an observation made long ago by B. Weiss f |WeOO) l. 
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Theorem 1.6. If x G is generic for an ergodic measure then it is a d-limit of 
a sequence of strictly ergodic points, where d is the Besicovitch distance 


1 " 

d{x,y) = limsup - \xi - yi\. 

^ n < ^ 


2=1 


It is easy to see that if ?/ is a sequence strongly or weakly correlated to x, then 
the same holds for y and any x' sufficiently close to x in d. This remark, together 
with the fact that every strictly ergodic point is generic for an ergodic measure 
immediately imply that 

Theorem 1.7. A sequence x G is weakly (strongly) correlated to a strictly 
ergodic point if and only if it is weakly (strongly) correlated to a point generic for 
an ergodic measure. 


This theorem allows us to formulate question [13] in a simplified, yet equivalent, 
version: 


Question 1.8. When is a sequence x G , generic for an invariant measure y, 
strongly (weakly) correlated to a sequence generic for an ergodic measure? 

Notice that if p. is ergodic then every point generic for p is correlated to a point 
generic for an ergodic measure (namely to itself). An exception occurs when p is 
a pointmass concentrated at a fixpoint. Such measure is ergodic, yet any of its 
generic points is uncorrelated to any sequence (even to itself). In either case, our 
question trivializes if p is ergodic. As we shall see, in the nonergodic case, the 
answer depends exclusively on the properties of the invariant measure, not on the 
choice of the generic point x. More precisely, the answer depends on the ergodic 
decomposition of p, leading to a discovery of some new features of nonergodic 
invariant measures. 

We conclude this section with a remark concerning the Mobius function p. We 
can now connect Boshernitzan’s question with another celebrated conjecture, the 
Chowla Conjecture. Leaving aside its precise formulation (see |Cho65j L its validity 
would imply that p were generic for a specific ergodic measure. In view of Theorem 
O this would also imply the negative answer to Question ll.2l the Mobius function 
(as being generic for an ergodic measure not concentrated at a fixpoint) would be 
strongly correlated to some strictly ergodic sequence. 


2. Correlation of measures 

Following the discussion of the preceding section, we introduce the correlation 
of measures. 


Definition 2.1. The correlation between two shift-invariant measures p and v on 
is the number 


corr{p, v) = sup 


xiyi df, 



where {x,y) i-A xi and {x,y) i-A yi are the “first symbol value” functions on the 
Cartesian square of the shift space , and ^ ranges over all joinings of p with 
V. The measures are uncorrelated if corr{p,i') = 0. Otherwise we say that the 
measures are correlated and any joining for which the above difference is nonzero 
will be referred to as a correlating joining. 
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We can now formulate a question concerning invariant measures, completely 
analogous to Question 11.51 posed for sequences generic for invariant measures. We 
will say that a measure is strictly ergodic if its topological support (viewed as a 
subshift) is strictly ergodic. 

Question 2.2. When is a shift-invariant measure /x supported by , uncorrelated 
to any strietly ergodic measure? 

In order to simplify this question we first prove an analog of Theorem 1 1.71 which 
allows to drop the adjective “strictly” from the formulation. 

Theorem 2.3. A shift-invariant measure fi on is correlated to a strictly ergodic 
measure if and only if it is correlated to an ergodic measure. 

Proof. Since every strictly ergodic measure is ergodic, one implication is trivial. 
Suppose p, is correlated to an ergodic measure u. Let ^ be a correlating joining 
of p, and u. Let {x, y) be generic for f (such a pair exists in the full shift over 
K X K). Then x is generic for /i, y is generic for u, and these points are strongly 
correlated. Indeed all the limits of averages in the definition of strong correlation 
of the sequences x and y are equal to the respective integrals in the definition of 
correlation of the measures pL and v with help of the joining f. Now, by Theorem 
[m y can be replaced by a strictly ergodic element y' (which is generic for a 
strictly ergodic measure u'), so that x and y' are strongly correlated. Along some 
subsequence the pair {x,y') semi-generates an invariant measure ff on {K x AT)^, 
and since x and y' are generic for p and u', the marginals of are p and u', i.e., f 
is a joining of p and u'. Now we need to reverse the preceding argument: all the 
integrals in the definition of correlation of p and v' with help of the joining f are 
equal to the respective limits (along a subsequence) in the definition of correlation 
between x and y'. Since these points are generic and strongly correlated, the limits 
indicate correlation regardless of the subsequence. So, f is a correlating joining of 
p and v'. □ 

Question 12.21 takes now a simplified, equivalent form: 

Question 2.4. When is a shift-invariant measure p supported by , uncorrelated 
to any ergodic measure? 

Uncorrelation is a “weak form” of disjointness (in the sense of Furstenberg), 
which is the condition that all expressions f f{x)g{y) df — J f{x) dp ■ f g{y) dv 
equal zero, when evaluated for all bounded measurable functions / and g of one 
variable. If p and v are disjoint then they are obviously uncorrelated. This raises 
two further natural questions: 

Question 2.5. Are there invariant measures disjoint from all ergodic measures? 

Question 2.6. Does the reversed implication hold: does uncorrelation to any 
strictly ergodic measure imply disjointness from all (strictly) ergodic measures? 

Question 12.51 is answered positively by a relatively simple example (provided in 
the last section). 

As far as Question [2T6] is concerned, it is rather hard to expect that uncorrelation 
for just one specific function (the “first symbol value”) should imply disjointness. 
On the other hand, in the case of symbolic systems, this particular function cor¬ 
responds in fact to a generating partition, and in many proofs in ergodic theory 
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it suffices to consider a generating partition. It turns out that the answer to this 
question is negative (which makes uncorrelation to any ergodic measure a new 
property). However, the appropriate example, that we provide in the last section, 
is far from trivial; in order to verify the desired property we needed to establish 
new criteria for ergodicity of certain types of cocycle extensions. Our example has 
inspired us to introduce the notion of conditional disjointness. This idea and its 
applicability to studying uncorrelation to any ergodic measure are presented in the 
Appendix at the end of the paper. 

Since uncorrelation is essentially weaker than disjointness, another question 
arises: 


Question 2.7. Is the property of being uncorrelated to any ergodic measure an 
isomorphism invariant? 

Again, the answer turns out negative. This property is not even invariant under 
topological conjugacy. That is, after transforming the shift space via an injective 
sliding block code (in this manner the shift space is modeled inside another symbolic 
space), a measure uncorrelated to any ergodic measures may lose this property. It 
is so, because the “first symbol value” function Xi may dramatically change in the 
sense of information content. This is illustrated in another example provided in 
the final section. So, the property must be regarded as one of the shift-invariant 
measure in a particular symbolic representation. 

We continue with further criteria for correlation with an ergodic measure. Note 
that the following one is an isomorphism invariant: 

Theorem 2.8. If the ergodic decomposition of p, has an atom, which is not the 
Dirac measure at a fixpoint of the shift transformation, then /i is correlated to an 
ergodic measure. 

Proof. Let n be the ergodic measure which is the atom of the ergodic decomposition 
of p. That is, 

p=pv+ {l-p)p!, 

where p € (0,1] and p! is some invariant measure. Then p, admits the following 
joining with u: 

£. =pyA + (1 -p)(m' X v), 

where is the identity joining of v with itself, supported by the diagonal, and 
p! X V denotes the product measure. It is now elementary to verify the correlation 
between p and v using 


P 

P 


xiVi d^- xidp yi dv = 


J xiyi dnA + {l-p) J xiyid[p' xv)- J 
j \xi\^ dv + {I - p) J xi dp' J ^di/- p J 


Xidi'+{l—p) J xidp'^ j 
xidv — (1—p) J xidp' J 
p(^J diy - 


xi dv 


yidv = 
yidv = 
> 0 , 


with equality holding only when xi is constant ^-almost surely, that is when u is 
concentrated at a fixpoint of the shift transformation. □ 
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The theorem allows to determine, in particular, that an invariant measure being 
a convex combination of finitely many ergodic components, at least one of which is 
not a pointmass, is correlated to some (strictly) ergodic measure. 

The main result of this section shows that Questions 11.51 and 12.41 are in fact 
equivalent. Note that correlation of measures has no weak or strong form. 

Theorem 2.9. Suppose that x S is generic for an invariant measure pL. The 
following conditions are equivalent: 

(1) X is weakly correlated to a point generic for an ergodic measure, 

(2) X is strongly correlated to a point generic for an ergodic measure, 

(3) pt is correlated to an ergodic measure. 

Proof. Suppose that x, which is generic for pt, is weakly correlated to some y generic 
for an ergodic measure u. This means that there exists a subsequence (jik) such 
that the limit limfc corr(a;[i_„j,], ?/[i exists and is different from zero. There is 
a sub-subsequence (rife,) along which the pair (x, y) semi-generates an invariant 
measure f on {K x K)^. Exactly as in the proof of Theorem 12.31 (the argument 
involving y', v' and f ), ^ is a correlating joining of pL and v. 

Now, suppose that a point x is generic for an invariant measure pL which is 
correlated to an ergodic measure v. Let ^ be a correlating joining of p, and u. We 
need to refer to the following fact, whose proof occupies the next section. 

Theorem 2.10. Let be a joining of two shift-invariant measures p and v sup¬ 
ported by , and let x € be a point generic for p. Then there exists a point 
y G such that the pair {x,y) is generic for f, (in particular, y is generic for v). 

It is obvious that by applying the above theorem to our situation we obtain 
a point y generic for the ergodic measure v and such that x and y are strongly 
correlated. This ends the proof of Theorem 12.91 □ 

3. Generic points for joinings 

This section is devoted to proving Theorem 12.101 To avoid ambiguity, we will 
distinguish between free blocks of length to, i.e., the elements of (there are 
precisely free blocks) and blocks of length to, i.e., subblocks of length m of a 

longer block B or of a symbolic element x. There are precisely |i?| — to -I- 1 blocks 
of length TO in B. Each block is an occurrence of a free block. Every free block 
B G determines, for every m < n, what we call the empirical measure ps on 
the finite space LC™ of free blocks of length to, by the formula 

IJ-b{D) = i#{i G [l,n - m + 1] : = D}. 

(for the ease of computations, we divide by n rather than by the more commonly 
used denominator n — m -I- 1, as a result we obtain a sub-probabilistic vector). 

Definition 3.1. For (probabilistic or sub-probabilistic) measures on we define 
the distance 

dW(/i,u)= ^ \p{D)-v{D)\. 

A block B is said to be (m, e)-generic for an invariant (probability) measure p, on 
if d^^\pB,p) < e. 
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The sub-probabilistic normalization of the empirical measures causes that any 
block (m, e)-generic for a probability measure has length at least 

Notice that a point x is generic for a measure /i if and only if, for every positive 
integer m and e > 0, the blocks a;[i,n] are eventually (i.e., for large n) (m, e)-generic 
for /j.. 

We will need two technical lemmas. 


Lemma 3.2. Assume that B G and are both (rn, e)-generic for /i, where 
I < n(l — ^). Then is (m, 3y/e)-generic for fj.. 

Proof. We have 

\ff{iG 1] : = D} - ^j.{D)\ = nd^'^\giB, h) < ne 

and the same for I in place of n. Thus, subtracting sidewise, we obtain 


Y |#{* € [l-m + 2,n-m + l]: = D} - (n- /)/r(D)| < 

(n + l)e, 


The above calculation misses m — 1 blocks of length m appearing on the left end 
of Adding this number to the right hand side and dividing both sides by 

n — I we obtain 


Y k%+i,n](-D) - < 


n + I 


n — I 


e 


TO — 1 
n — I 


Sen 

< - 

n — I 


(we have also used the inequality to < en, which holds since B is (to, e)-generic). 
Applying the inequality n — I > n^/e we finish the proof. □ 


The easy proofs of the following facts are left to the reader. All blocks addressed 
below are longer than to, the empirical measures are defined on A'™, and the 
distance between them is . 


Lemma 3.3. 

(1) Let B G if". Then the empirical measures determined by B and 
are at most 2^^ apart. 

(2) Suppose the empirical measures determined by two blocks B and C of the 

same length are less than e apart, and the same holds for a pair B', C. Then 

the empirical measures determined by the concatenations BC and B'C 

are less than 2e apart, assuming that the joint length of the concatenation 

exceeds —. 

€ 

We pass to the main proof of this section. 

Proof of Theorem \2. 1(A Let us start by taking a pair {x', y'), generic for the joining 
f (such a pair exists in (if x if)^). Clearly, the points x' and y' are generic for 
the projections y and n, respectively. We will think of {x',y') as of a sequence 
consisting of two rows, x' in the first row, y' in the second. 

Choose a decreasing to zero sequence of positive nnmbers Cm (jn > 1). By the 
Rokhlin Lemma applied to the system with the measure fa we have: for each to 
there exists a Rokhlin tower of height to, with remainder of measnre less than em- 
By a standard argument, the bases of the towers (and hence the remainders) can 
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be chosen closed-and-open (we will say clopen). By lifting to the product space we 
also have Rokhlin towers for the measure 

Throughout the following three paragraphs we fix one value of the parameter 
m. For each point in the product system (i.e., sequence in {K x K)^), we mark 
the times of the visits in the base of the Rokhlin tower by an additional symbol 
(star) and the visits in the remainder of the tower by another additional symbol, 
say, a dagger. Because we chose the bases clopen, adding the stars and daggers 
is a topological conjugacy of {K x K)^ with a subsystem of {JC x /C)^, where 
K, = K X {*, f,0}. In particular, generic points are preserved. We can think of 
^ as a measure supported by (/C x /C)^ (and /r as a measure supported by /C”), 
we will call this the marked representation. Every point generic for p, (including 
X and x') has, in this representation, the structure of a concatenation of blocks of 
the length m starting with the star (we will call these blocks m-blocks), separated 
by strings, of various lengths, of symbols marked by the dagger (the sequence may 
start with a prefix of length at most m — 1, without any markers). The daggers 
occur with density less than Cm, the stars have density between and —, every 
free m-block occurs with the density equal to its measure. We let Im be such that 
for every n > Im the block (s', yO[i,n] is (m, em)-generic for ^ (in particular, the 
block is (to, em)-generic for p) and also the block X[i^n] is (to, em)-generic for 
p. We can inductively arrange that Im+i is larger than This proportion and 
Lemma [ 3 T 3 I 111 imply that for n > Im+i the blocks (a;', y')[;^_|_iand are 

(to, 3em)-generic for ^ and p, respectively. 

We will now describe the inductive ruth step of our construction, in which we 
define the second row y in the pair {x, y) between the positions Im + ^ and Im+i 
(we let be defined arbitrarily). 

Given a free ru-block A over the alphabet K, we locate all its occurrences in both 
^[im+i,im+i] ^'[i + 1 / + 1 ] (sitting completely inside). Next we define y at the 
positions corresponding to the occurrences of A in a; proceeding from left to right 
and rewriting consecutive symbols of y' from the positions corresponding to the oc¬ 
currences of A in a;jj maintaining the order. We continue until we exhaust 

the available positions in either a;[;^_|_i or im+i] both). We perform 

this procedure separately, for every free ru-block A. Note that since the ru-blocks 
always occur separately (without overlapping), there will be no collision at any po¬ 
sition of y. As a result, nearly all two-row ru-blocks occurring in (a;', t/')[im-i-i,im+i] 
will be copied in (a^, y)[i„,-i-i,i„ 4 .i]; there is a 1-1 correspondence between nearly all 
ru-blocks here and here. The correspondence is defined except on a small percent¬ 
age of the ru-blocks here and here, mainly occurring near the right end (for each 
free ru-block A the number of occurrences in a;jj may be slightly larger 

or slightly smaller than in - in either case the excessive ru-blocks on 

one side have to be excluded from the 1-1 correspondence). Precisely, since both 
first-row blocks are (to, 3em)-generic for p, this percentage does not exceed 6em- 
In the end, there may remain some unfilled positions in ; we fill them 

arbitrarily. This concludes the definition of y. 

In order to verify that the above defined pair (x, y) is generic for ^ we will 
compare, for every positive integer toq, the empirical measures on (K x K)'^° 
determined by the initial blocks (T,y)[i,n] and (a;', y')[i^„]. Since the latter tend to 
^ with increasing u, it suffices to prove that the distance between the above two 
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empirical measures tends to zero. So, we fix an toq and n larger than Img+i- We let 
m > mo be the largest index such that n > Im+i- Note that m implicitly depends 
on n and tends to infinity as n grows, so that all expressions of order Cm, or ^ 
tend to zero with n. To avoid notorious repetitions of the phrase “blocks of length 
Too over K x if” (as opposed to other blocks referred to in the argument) we will 
call them shortly the words. From now on empirical measures are always defined 
on so understood free words. 

We begin by proving that the empirical measures determined by (x, 
and (a:^ yO[im+i,Zm+i] close. We classify the words contained in (s, 
in four groups: 

(I) words not inside any m-block; these can be recognized by either having a 
star not at position 1, or a dagger anywhere (we mean the stars and daggers 
created in step m), 

(II) words contained inside any of the two possible m-blocks appearing at either 
end of and only partly contained in (x, y)[z^+i,i„_,_i], 

(III) words contained within an m-block sitting inside (x, but being 

the “excessive” m-block (excluded from the aforementioned 1-1 correspon¬ 
dence) , 

(IV) words contained within an m-block sitting inside and in¬ 

cluded in the 1-1 correspondence. 

By the (m, 3em)-genericity and thus also (1, 3em)-genericity of for p, 

the first group constitutes a small percentage of all considered words (smaller than 
RT-o(3em + ;^ + Cm))- Tbe same genericity implies small proportion between m and 
the length of , hence smallness of the second group. We already know 

that among all m-blocks contained in (a:, only a small percentage (at 

most 6em) is excluded from the 1-1 correspondence. This implies that the third 
group is small, as well. We have shown that the last group dominates (constitutes 
a percentage converging to 1 with growing n) of all words in (a;, . The 

1-1 correspondence between the m-blocks induces, in an obvious way, a 1-1 cor¬ 
respondence between words in group (IV) and words in the analogous group (IV) 
regarded for (a:', ?/')[;^+i (which, by a symmetric argument, dominates among 
all words in (a;', t/')[i^+i^;^_^j]). This clearly implies that the empirical measures 
determined by {x 'and (a:, are close, as desired. 

Now consider two cases: 

(a) Ti ^ R-y/Cm, and 

(b) the opposite. 

In case (a), (a:, is obtained from (a:,?/)[i ^] by truncating at most 

?T-(em + ^/^) terms at the ends, and the same holds for {x', y'). Thus, Lemma l531 fll 
(applied twice) and the closeness of the empirical measures of (a:, 
and (a:', 2/0[im-l-i,im+i] buply closeness of the empirical measures of (x, j/)[i_„] and 
(x', 2 /')[i which concludes this case. 

In case (b), we will argue that also the empirical measures determined by the 
blocks (a;, and (x', are close. Observe that the closeness of 

the empirical measures determined by (a;,and (x',t/')[/^_|_i_/^^p was 
deduced using exclusively the fact that the blocks and im+i] 

were (m, 3em)-generic for p. Now we are in a very similar situation: since Im+l < 
n{l — y/e^), we can use Lemma [3.21 to conclude that the blocks X[j^^j_|_i „] and 
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^[i + 1+1 n] both (m + 1, 3^em+i) -generic for Because the algorithm of defin¬ 
ing 2 /[i„+i+i,z^+ 2 ] proceeds from left to right, we can stop it when we reach the coor¬ 
dinate n and we will have the 1-1 correspondence already defined between majority 
of (m-l-l)-blocks in (a:, y)[i^^i+i,n] and (a:', . From here the closeness 

of the empirical measures determined by (a;, and (a;', (with 

^Cm+i replacing Cm in the estimates) follows as in the preceding argument. 

Now Lemma [5751 f2') yields closeness of the empirical measures of (x, y)[i^+i,n] and 
(x', (viewed as appropriate concatenations) and one more application of 

(1) extends this to (x,y)[i and (x', ?/')[i The proof in case (b) is complete. □ 

4. General sequences 

A general sequence over the alphabet K is typically not generic for any invariant 
measure but it is semi-generic for a range of invariant measures. As we will show, 
in such case we can decide about its uncorrelation versus weak correlation to a 
(strictly) ergodic sequence by examining all the measures semi-generated by x. We 
cannot expect strong correlation results in this case. The main theorem of this 
section requires a lemma similar to Theorem 12.101 

Lemma 4.1. Let x € be semi-generic, along a subsequence (ufc), for an invari¬ 
ant measure fi, and let ^ be a joining of fa with a strictly ergodic measure v. Then 
there exists a point y generic for v and such that the pair (x, y) semi-generates 
along a sub-subsequence of(nk)- 

Proof. We will only outline the proof, focusing on the details which are different 
than in the proof of Theorem l2.10l 

We pick a pair {x',y') generic for Since u is strictly ergodic, we can assume 
that y' is strictly ergodic, which implies that it is uniformly generic, i.e., for every 
integer m and e > 0 there exists an n such every block of length at least n, occurring 
in y', is (m, e)-generic for v. 

As in the other proof, we fix a decreasing to zero sequence Cm and, for each m, we 
create the marked representation with the m-blocks. We select the lengths Im from 
the subsequence (rifc) so that the blocks xq^j^j and (x', i/')[i_i^] are (m, em)-generic 
for p, and ^ respectively. As before, we arrange that Im+i is larger than but 
we do not care about genericity of initial blocks of other lengths n. The inductive 
step of defining y is identical as before. The verification that (x,y) semi-generates 
^ along {Im) (which is a sub-subsequence of (nfc)) is simplified; since we do not care 
about other lengths n we do not need to consider the two cases (a) and (b), or 
invoke Lemma 13.21 

We need, however, an additional argument to prove that y is generic for v (it is 
obvious that y semi-generates u along {Im))- But this fact follows easily from the 
assumption that y' is uniformly generic for v and that y is built as a concatenation 
(with insertions of density zero) of longer and longer blocks occurring 'my'. □ 

Theorem 4.2. A sequence x G is weakly correlated to a sequence generic for an 
ergodic measure if and only if x is semi-generic for at least one measure correlated 
to an ergodic measure. 

Rephrasing the theorem, x is uncorrelated to any sequence generic for an ergodic 
(equivalently, strictly ergodic) measure if and only if all of the invariant measures 
semi-generated by x have the property of being uncorrelated to any ergodic measure. 
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Proof. If X is weakly correlated to a point y generic for an ergodic measure v then 
there is a subsequence (rife) such that the limit limfe corr(a;[i^ji^], exists and is 

different from zero. Choosing a sub-subsequence we can assume that the pair (a;, y) 
semi-generates an invariant measure ^ on the product space. Clearly, the second 
marginal of ^ is iz, while the first marginal is an invariant measure y for which x is 
semi-generic and which is correlated with v (via the joining ^). 

Now suppose that x semi-generates an invariant measure y which is correlated 
to an ergodic measure p via a joining f. By Theorem 12.31 we can assume that v is 
strictly ergodic. Now, Lemma 14.11 allows to couple x with a point y generic for iz 
so that (x, y) semi-generates It is clear that x and y are weakly correlated. □ 

5. Examples 

Example 5.1. This is an example of an invariant measure on K^, disjoint from 
(hence uncorrelated to) any ergodic measure. Consider the mapping T(t, s) = 
(t, s+t) on the two-dimensional torus, equipped with the product Lebesgue measure 
dt X ds. Ignoring a set of measure zero, the space decomposes to invariant circles, 
where on each circle we have a different irrational (hence ergodic) rotation. Suppose 
that an ergodic measure p is not disjoint with dt x ds; it follows that the set A of 
such parameters x that v is not disjoint with the irrational rotation by the angle x, 
has positive Lebesgue measure in the base. The fact that v is not disjoint with an 
irrational rotation is equivalent to v having an eigenvalue rationally dependent with 
the rotation angle. Since the ergodic measure v possesses at most countably many 
different eigenvalues, it follows that the set A is at most countable, hence of measure 
zero, implying disjointness. Now, we take any (measurable) finite partition of the 
2 -torus which partitions nontrivially every ergodic circle, and label its members by 
elements of a finite subset K of the unit disc. This produces a symbolic factor y of 
dt X ds on disjoint from all ergodic measures (as a factor of such, nontrivial on 
each ergodic component). 

Example 5.2. There exists a shift-invariant measure on uncorrelated to any 
ergodic measure, yet not disjoint with an irrational rotation (in fact being an ex¬ 
tension of such a rotation). 

Consider the direct product of the identity on the circle Si = [0,1) (0 = 1) with 
an irrational rotation (by a) on (represented as (■ -fa) mod 1). Let A C 5'i x 
be the triangle A = {(f, s) : t G [0,1), 0 < s < f} and let 

ip{t,s) = 

(i.e., if equals —1 on A and 1 otherwise). Consider the Z 2 -extension (Z 2 written 
multiplicatively, as { — 1,1}) corresponding to the cocycle (/?: 

T^{t, s, k) = {t,s + a,K- ip{t, s)), 

where (t, s, it) G X = 5^ x Z 2 . This mapping preserves the measure y which 
is the product of the Lebesgue measure on and the Haar measure on Z 2 : 
dy = dt X ds X dn, and the corresponding measure-preserving system {X,T^,y) 
is clearly an extension of the irrational rotation by a, which appears on the second 
coordinate. Thus, this measure-preserving system is not disjoint from the ergodic 
system represented by the rotation. It is easy to see that the above Z 2 -extension 
has a symbolic representation over two symbols { — 1,1} obtained by replacing each 
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point by its Z 2 -forward itinerary 

(t, s, I y 


where 

n—2 

^ ia), 

i^Q 

where the product of zero terms (occurring for n = 1) equals, by convention, 1. In 
this representation, /i becomes a shift-invariant measure supported by K^, where 
K = { — 1,1} is a finite subset of the unit disc, and hence fits in the framework of 
Section [2j Notice that the “first symbol value” function in this symbolic represen¬ 
tation coincides simply with k in the skew product representation. 

We will argue that so defined /r is uncorrelated to any ergodic measure supported 
by any complex-valued subshift. In what follows, we will switch freely between the 
symbolic and skew product representation of depending on our needs. 

Clearly, /i is not ergodic, its ergodic components are measures fit supported by 
the cocycle extensions of the circle rotation with fixed parameter t, and the “section 
cocycle” (pt (equal —I on the arc [0,t) and 1 otherwise). Of course, not all such 
cocycle extensions are ergodic (meaning that the measure ds x dn need not be 
ergodic), but as we will explain in the next section, the set of parameters t for 
which this happens has measure zero (see the statement (16.51) 1. so in the ergodic 
decomposition of fi such parameters may be ignored. 

Let u be an ergodic measure on a complex-valued subshift. Since fi has zero 
mean (the integral of xi is zero), in order to prove that u is uncorrelated to fi we 
need to show that f yiXi d^ = 0 (the complex conjugate can be skipped because xi 
is real) for any joining ^ of u and fi. Let ^ = f ^tdt he the disintegration of ^ with 
respect to dt, so that is a joining of u with fit- 

We have f yixid^ = I(f d^t) dt hence it suffices to show that the inner 
integral vanishes for almost every t. We will do it by proving the following claim: 

• There exists a set i? C of full product Lebesgue measure, such that when¬ 
ever a sequence satisfies, for every j ^ j' , the condition (tj, t^') S 

then the sequence J yiXi d^t, tends to zero with j. 

At first we argue why is this claim sufficient. Suppose that f yiXi d^t 7 ^ 0 
on a positive measure set of parameters t. Then, for some e > 0 the inequality 
I J yixi d^t\ > e also holds on a positive measure set F of parameters t. There 
exists ti G F such that the ti-section of E has full measure. Then there exists 
t 2 G F belonging also to the aforementioned -section of E and such that the 
t 2 -section of E has full measure. Inductively, once ti,... are selected, we pick 
tj+i G F belonging to all the t^'-sections of E for j' < j (and having a full measure 
section of E itself). Such sequence {tj) satisfies the condition that all distinct pairs 
are in E, while the corresponding sequence of integrals does not tend to zero, so 
the claim does not hold. 

In order to prove the claim, assume temporarily that the required set E exists and 
fix a sequence (tj) as described above (with all distinct pairs in E). From now on, 
we will abbreviate the indexes tj by j (and write fij , ^j , (pj instead of , Cq j ‘ftj , 
respectively). Consider a countable joining ( of the measures n, pi, p 2 , Ps, ■ ■ ■, such 
that for every j > 1 the projection of C jointly on the zero’th and jth coordinate 
equals ^j (there exists such a joining: all joinings ^j have the common factor u. 
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hence we can take ^ to be their relatively independent joining over the common 
factor). 

On the product space supporting C we have the function yi (depending only 
on the zero’th coordinate) and the functions , where depends 

only on the jth coordinate and represents the first symbol value on the support 
of Hj. All these functions are measurable and bounded, so they belong to A^(C)- 
The integrals f yiXi can be written as f yix[^^ d(, i.e., they become the inner 
products {yi,x^i^) in L'^{C)- If we knew that the functions x^-^^ were pairwise or¬ 
thogonal (they are obviously normalized), then the above inner products would be 
the Fourier coefficients of the projection of yi onto the subspace spanned by the 
functions and thus they would form a sequence belonging to in particular 
they would converge to zero, as needed. 

For orthogonality of x^^ and x'f ^ (for j ^ f) we need to check that the integral 
/ x^^^x^^ ^ dC equals zero. This integral equals / x^^^x^^ ^ where Ci.j' is the 

projection of C onto jointly the jth and j'th coordinates. Clearly is a joining 
of jij and fiji, so, in fact, it suffices to show that and are uncorrelated. Since 
both measures are ergodic, every their joining decomposes to ergodic joinings, thus 
it suffices to examine their ergodic joinings (denoted henceforth by 6) only. 

Now we must go back to the original skew product representation and study pos¬ 
sible ergodic joinings 0 of pt and pt' (where t, t' abbreviate tj and ty , respectively). 
Let {x,x') be a pair generic for 9. This pair is obtained in the following manner: 
we choose two points, sq and Sq on the circle, and two initial values, kq and Kg, 
from Z 2 , and then x and x' are given (as symbolic sequences) by the rule 

n—2 

= ^0 ^t(so + Xa), 

i^O 

n—2 n—2 

= ^0 n +*'^) = ^0 n ‘ft'{so + u + ia), 

2—0 2—0 

where m = Sg — sq. By genericity of (a;, a:'), the integral of the product of the first- 
symbol value functions can be evaluated as the limit of the averages of the products 
of the jth-symbol values, i.e., we just need to look at the sequence Xnx'^ (obtained 
by coordinatewise multiplication of the above two sequences). Since kqKq is just 
another element of Z 2 , such sequence is obtained as the symbolic representation of 
the point (sg, kqk'q) in the cocycle extension (of the same rotation by a), with the 
new cocycle (p(s) = (pt[s)(pt'{s + u). This new cocycle equals —1 on the symmetric 
difference of the intervals [0,t) and [u,u-\-t') (and 1 on the rest). Suppose that this 
new cocycle extension is ergodic (with respect to the product measure ds x dn). By 
a classical theorem of Furstenberg f [Fu61] '). this extension is also strictly ergodic, 
and hence every point (in particular the one we have selected, (sq, koKq)) is generic 
for ds X dn. Thus, the limit of the averages we are interested in equals the integral of 
the “first symbol value” function in the symbolic representation of the new cocycle 
extension, i.e., of the function (s, k) i—>■ k. Clearly, the integral of this function with 
respect to ds x dn equals zero. 

In this manner, we have arrived to the following conclusion: All we need is the 
existence of a set A C of full product Lebsgue measure, such that every pair of 
parameters (t,t') € fulfills: 
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• for any u € Si the cocycle extension corresponding to the cocycle equal to 
— 1 on the symmetric difference of the intervals [ 0 ,t) and + t') (and 1 
on the rest) is ergodic. 

The following section is devoted to studying ergodicity of four-jump Z 2 -extensions. 
As a corollary of the criteria which we provide, we will derive that the above set E 
indeed exists (see Theorem 16.41) . This ends the verification of our example. 

Example 5.3. This example shows that the property of being uncorrelated to any 
ergodic measure is not a conjugacy invariant. 

In the preceding example, we have a shift-invariant measure p on {—1,1}^. The 
code 

n(a;)„ = XnXn+l 

has the effect that it turns the cocycle (p into a semicocycle, that is it reproduces 
the symbolic system arising from reading the function p along the orbits (without 
choosing randomly the initial value and without the cumulative multiplication). In 
this factor, the first symbol value function is equal (up to measure) to the function 
p on Si with the Lebesgue measure. Now consider the code 

3 1 

n (^)n — 

This is a conjugacy sending our system to a subshift over four symbols {—1, — 5 , 5 ,1} 
so that the first symbol carries information about both the original first symbol (by 
just looking at the sign) and the first symbol of the factor by 11 (by looking at the 
finer value). We will show that the measure 11'(/r), although it is conjugate to p 
uncorrelated to any ergodic measure, is correlated to an ergodic measure, namely 
to the rotation by a represented symbolically as a Sturmian system given by the 
semicocycle pt (the choice of t is in fact arbitrary, but to get the classical Sturmian 
representation we choose t = a). Indeed, for any joining ^ of n(/iQ,) with 11'(/i) we 
have 

j yixid^=^ j p{a,s')xid^+^ J p{a, s')p{t, s) di, 

where ^ is understood as a joining of the Lebesgue measure on the circle {a} x Si 
(here the variable is s') with p, which in the rightmost integral is replaced by the 
product Lebesgue measure (here the variables are t and s). The central integral 
equals zero, because it attempts to correlate p with an ergodic measure. We can 
choose ^ so that in the last integral it represents a joining concentrated on the 
diagonal set {s' = s} and here it is the Lebesgue measure dt x ds (such measure is 
easily seen to be a joining of n(/iQ,) with n(/r), so it can be lifted to a joining of pa 
with p). Then the integrated function equals —1 for pairs (t, s) such that t < s < a 
and a < s < t, and 1 otherwise. As very easy to see, this integral is positive (a, 
being irrational, differs from 0 or 1 ) and so ^ is a correlating joining. 

6. Ergodicity of four-jump Z2-cocycles 

In this section we will consider Z 2 -extensions T^p of an irrational rotation (by a) 
on the circle Si equipped with the Lebesque measure (denoted by ds), determined 
by a cocycle p : Si ^ 2 , 2 : 

T^{s, k) = {s + a,K- p{s)). 
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For n > 1, the nth iterate of T^p is determined by the n-step cocycle 

n—1 

(^W(s) = ip{s+ja), 

3=0 


namely 

k) = (s + no, K • 

As before, we will say that the cocycle ip is ergodic if the product measure ds x dn is 
ergodic under T^p. One easily proves the following criterion for this Z 2 -extension: tp 
is ergodic if and only if it is not a coboundary, i.e., there is no measurable solution ip 
of the so-called cohomological equation, which, in the multiplicative notation, reads: 

6.1 (pis) = — 77 ^. 

ip{s) 

The function tp, if it exists, can be taken with values in Z 2 . In such case, for each 
n > 1 we have 


( 6 . 2 ) 




ip{s + na) 

■0(5) 


The following lemma gives a sufficient condition for ergodicity of a cocycle. 


Lemma 6.1. If there exist a sequence (uk) of positive integers such that 

• UkOi —>-0 in Si, and 

• -yS 1 in L^{ds), 

then the cocycle ip is ergodic. 

Proof. If ip is not ergodic then (16.11) has a measurable solution ip, and (16.2p holds. 
But then rifcO 0 in 5'i implies (via Luzin Theorem) that ip{s + Ukoi) tends to 
ip{s) in L^{ds) which yields that —>• 1 in L^{ds). □ 


We will be using the following notation: For s € Si = [0,1), we let 

||s|| = min(s, 1 — s) 

(the distance of s to 0 in Sf). This quantity satisfies, for s,s' G Si, the triangle 
inequality ||s-|-s'|| < ||s|| + ||s'||, and ||gs|| < ( 7 ||s||, for q G N (the sums and multiples 
of elements in Si are understood modulo 1). 

The continued fraction expansion of a will be written as [0; oi,..., a^,...] and 
(^)fe>_i will be the sequence of its convergents. Recall that, for all fe > 1, we have 

(6.3) a = — -I-— , where |Cfc| < -<-, lkfea|| < -, 

qk qk qk+i ufc+i qk qk+i 

(6.4) ^ - < \\qk-ia\\ < \\ja\\, Vj : 0 < |j| < qk. 

2qk qk + qk-i 

According to the inequality (16.41) . for s G S'!, the distance between any two 
elements of the set {s — ja : j = 0,..., qk — 1} is larger than 

For t,t',uG Si we will consider the Z 2 -extensions given by the cocycles 

» ipy. = ^ 

• flt'is) =ipt{s)-ipt'{s + u). 
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The function equals —1 on the symmetric difference of the (positively oriented) 
arcs [0,t) and [u,u + t') and 1 otherwise. 

Ergodicity of the two-jump cocycles ipt has been discussed in a series of papers 
(W. Veech |Ve69] . [Ve75] . K. Merrill [Me85] . M. Guenais and F. Parreau |GuPa06| V 
In the latter paper a complete characterization of ergodicity has been given in 
terms of Ostrowski’s expansion of the parameter t relatively to a. These results in 
particular imply that 

(6.5) the set of parameters t for which pt is ergodic has full Lebesgue measure. 

Ergodicity of cocycles with 4 jump points has been proved for some families (cf 
|Me85] . |GuPa06) . |GoPil4) l. Nevertheless, for cocycles of the form it seems 
that no complete characterization of the ergodic case has been given, li t = t', as 
shown in |Me85) (even when a is of bounded type), there exists an uncountable 
family of parameters {t, u) such that the cocycle pfj. is not ergodic. 

We will prove: 

Theorem 6.2. If the cocycle is not ergodic then the parameters t,t',u satisfy 
simultaneously the following four convergences 

limmin{||gfet||, Hg/cwH, \\qk{t' + u)\\} = 0, 

k 

limmin{||(7fct||, \\qk{t - u)||, \\qkit - t' - u)||} = 0, 

k 

limmin{||( 7 fc'u||, \\qkit - m)||, \\qkt'\\} = 0, 

k 

limmin{||( 3 'fe(t' -h u)||, \\qkit - t' - u)||, Hftt'll} = 0, 
k 

where (qk) is the sequence of denominators in the continued fraction expansion of a. 
In particular, we have 

(6.6) limmin{||gfc(<-t')||,||(3'fc(t-l-t')ll} = 0- 

Remark 6.3. The points 0,t,u,t' + u are discontinuities of the cocycle (p^^, (they 
may appear in [0,1) in a different order). Of course, it may happen that some of 
these points coincide (and the cocycle has only two or even no discontinuities). If 
we denote these points as Xi,X 2 ,X 3 ,X 4 , the four convergences in the assertion of 
the theorem can be written as one condition: 

(6.7) for every is {1, 2, 3,4} we have lim min ||( 7 fc(a:i — a:i')|| = 0. 

k 

The proof of the theorem will be provided in a moment. First we derive from it 
the following, important for us, result: 

Theorem 6.4. The set E G of pairs (t,t') such that the cocycles are 
ergodic for all u G Si has full product Lebesgue measure dt x dt'. 

Proof. By Theorem 16.21 it suffices to show that (16.61) fails for Lebesgue-almost all 
pairs (t, t'). For a moment let {qk)k>i be any strictly increasing sequence of positive 
integers. We recall the equirepartition property. Lebesgue-almost every pair (x, y) 
satisfies, for every continuous function f on Sf, the condition 

1 N-l 

f{qkX,qky) = 


J fdudv. 
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Indeed, by approximation of / by trigonometric polynomials and linearity, it suffices 
to prove the above for f{x,y) = ^ with n,m £ Z, and clearly this is 

nontrivial only when either n or to is different from 0. Since (qk) strictly increases, 
the bounded functions x+mqk y) pairwise orthogonal (in the Hilbert space 

L'^{dx X dy)). By Rajchman’s Strong Law of Large Numbers (cf. [Chu68]) this 
implies that 

JV-l 

__ ^2'n:i{nqux+mqkv) _ Q 

N N ^ ’ 

fc =0 

for almost every (x, y), as needed. Since there are countably many pairs ( 71 , 171 ), 
the equirepartition property holds on a full measure set. In particular, for each 
pair (x,y) in this set, the sequence {qkX,qky) mod I is dense in Sf. By change of 
coordinates, there exists a set E of full Lebesgue measure in Sf, such that for every 
pair (t, t') G E the sequence {qk{t — t'), qk{t-\-t')) mod I is dense in S^, in particular 
{t,t') does not satisfy (16.61) . The assertion of the theorem is obtained by choosing 
(qk) to be the sequence of denominators in the continued fraction expansion of 
a. □ 


We now pass to the main proof. 

Proof of Theorem 1 6. ill According to Remark l6.31 we will focus on proving condition 
(16.71) . Condition (16.61) will be derived at the end. 

So, suppose that (I 6 J 1 ) fails. Then there is iq G {I, 2,3,4} and a subsequence 
(ki) (of the indices k) and 7 > 0 such that for any i ^ iq vn {I, 2,3,4} the limit 
below exists and satisfies 

( 6 . 8 ) lim||gfc,(xi - XiJII > 7 . 

Denote ip = ipff,- The discontinuities of the n-step cocycle occur at the 
4n points Xi — ja, where i G {1,2, 3,4}, j G (0,1,..., n — I}. Points obtained 
for the same index i will be called discontinuities of type i. In fact, some of the 
discontinuity points may coincide (in which case the corresponding discontinuities 
disappear), but this case will turn out to be trivial. 

By minimality of the irrational rotation, the return times of the orbit of 0 to its 
neighborhood form a syndetic set. Since (qk) grows to infinity, it is easy to find a 
sequence Uk such that > 0 in S'! and which has the same asymptotics as ^qk, 

i.e., ^^ 7 - 

We are assuming that ip is not ergodic. Lemma EH] yields that tends to I in 
L^{ds). The L^-distance of to the constant function I equals twice the joint 
measure of the intervals on which = — 1 . First suppose that no discontinuity 
of type *0 coincides with another discontinuity. The function assumes the 

value —I on exactly one side of each discontinuity of type ig, until the nearest 
discontinuity point on that side. Thus, the measure of the set where = — 1 
is estimated from below by Uk (the number of discontinuities of type * 0 ) times the 
minimal distance between such a discontinuity and the nearest discontinuity. If a 
discontinuity of type io coincides with another, this minimal distance equals zero 
and the above estimate holds trivially. In either case, we conclude that this minimal 
distance must be of the order o( —), which is the same as o( —). 

) nk ’ ^qk' 

By the remark following (16.41) . for large enough k, the above minimal distance 
must not occur between discontinuities of the same type io, that is the minimal 
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distance occurs between a discontinuity of type io and one of a different type i ^ io 
(depending on k). Clearly, as we proceed along the previously selected subsequence 
(fc;), some type ii ^ iq appears in this role infinitely many times. Replacing (ki) 
by a subsequence, we can thus assume that we always have the same (fixed) type 
ii io- Summarizing, we have fixed a subsequence (ki) of the indices k and a pair 
of types io and ii ^ iq such that there exist two sequences of nonnegative integers 
Ui)i ifi) bounded from above by (rife, — 1 ), satisfying 

lim%, ||(a:io - jia) - {xi, - j»|| = 0. 


Now, since 


hkiiji -fi)a\\ < \ \ ■ \\qkM\ < <V±L 

Qki+i qk, I 

(we have used (16.31) for the central inequality), we can write 


7 < \im\\qki{xig - a;ij|| < 

limqfc,||(xio -j'ia)\\ + limsup ||qfc, (j; - j'i)a\\ <0 + 7 . 

I i 

This contradiction ends the proof of (IgTl) . 

To show () 6 . 6 I) . it suffices to consider subsequences J = {kj) such that the limits 

^j(s) = lim||gfe a;|| 

0 

exist for all the 24 points x obtained by adding or subtracting pairs of different 
points from {xi, X 2 , X 3 , X 4 } (every subsequence of (k) contains a sub-subsequence J 
of this kind), and prove that for any such J either (3j{t — f) = 0 or fij{t -I- 1') = 0. 

The already proved four convergences in the assertion of the theorem imply that 

min{/3j(t),/3j(M),/3j(t' -|- u)} = 0, 
min{,5j(t), I3j{t - u),Pj{t - t' - u)} = 0, 
min{/3j(u), /3j{t - u), /3j(t')} = 0 , 
min{/3j(t' -|- u), Pj{t - t' - u), /?j(t')} = 0- 

We will consider three cases: 

• If /3j{u) = 0 then /3j(t — t')} = 0 (from the second relation) 

and min{/3j(t'),/3j(t — t')} = 0 (from the fourth relation), therefore either 
/3j{t — t') = 0 or both = 0 and = 0. But the latter possibility 

also implies /3j(t — t') = 0 . 

• If /3 j(m) > 0 and /3j(t) = 0 then = 0 (from the third relation), which 

implies /3j(t — t') = 0 . 

• If fij{u) > 0 and Pj{t) > 0 then Pj{t' + u) = 0 (from the first relation) and 
mm{(3j{t — u), I3j{t')} = 0 (from the third relation). In this case, Pj{t') = 0 
is impossible (it would give /3j(t' + u) = Pj{u) > 0), so that we must have 
I3j{t — u) = 0, which combined with /3j(t' -|- m) = 0 yields /3j(t -I- t') = 0. 

Therefore in all cases we have either /?j(t — t') = Q or /?j(t -|- 1 ') = 0, as needed. □ 
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Appendix A. Conditional disjointness 

As we know, if two invariant measures are disjoint then they are also uncorre¬ 
lated. This follows merely from the fact that the condition defining disjointness is 
the same as that for uncorrelation, only it must hold for all, not just one selected 
pair of measurable functions. Our Example 15.21 suggests that there is a weaker 
version of disjointness, which also implies uncorrelation. This condition, defined 
below, is specifically well applicable to cocycle extensions. 

Definition A.l. Let {X,A.,fJ.,T) be a measure-preserving system and let B be a 
sub-sigma-algebra of A. We will say that {X^A,p,T) (or shortly fi) is conditionally 
(given B) disjoint from another system {X',A', fi',T') (or shortly, from p') if 

J f{x)g{x')d( = 0 , 

for every joining f, of p and p' and every pair of bounded measurable functions g 
on X' and f on X satisfying E(/|S) = 0. 

Clearly, such an / has zero integral, hence the right hand side above can be 
written as f f{x) dp f g{x') dp', as in the definition of (unconditional) disjointness. 
Disjointness is the same as conditional disjointness given the trivial sigma-algebra. 

The following is now obvious. 

Fact A. 2. Suppose {X,A,p,T) is a symbolic system over a finite alphabet K con¬ 
tained in the unit disc and that it is conditionally (given some sub-sigma-algebra 
B) disjoint from any ergodic system. If the “first symbol value” function f satisfies 
E(/|S) = 0 then p is uncorrelated to any ergodic measure. 

The above fact is especially useful in considerations of symbolic representations 
of cocycle extensions (of some {Y, B, v, S)) with cocycles taking values in finite sub¬ 
groups K of the unit circle. In such case, the first symbol value function evaluated 
at Ijj, k) equals k and has zero expectation with respect to B. This situation occurs 
in our Example 15.21 where the proof shows that the measure p is conditionally 
(given the irrational rotation factor) disjoint from any ergodic measure. Hence the 
conditions in Eact IA.2I are satisfied. 

This can be generalized as follows: 

Suppose (A, A, p, T) is a symbolic representation of a cocycle extension of an 
ergodic discrete spectrum system (Y, B, v, S) with a cocycle taking values in a finite 
subgroup K of the unit circle. The discrete spectrum system can be represented as 
an ergodic rotation of a compact Abelian group (the addition y -\-u, for y,u €Y is 
understood in this group). 

Proposition A. 3. Suppose the ergodic decomposition of p is p = J ptdt, t € 
[0,1], (note that this parametrization can be applied whenever the decomposition 
is nonatomic), where each pt also represents a cocycle extension of {Y,B,v, S) 
with a cocycle (pt. If for almost every pair t,t' and every u € Y, the cocycle 
(p(^,{y) = (pt{y) pt'{y + u) is ergodic, then p is conditionally (given B) disjoint 
from any ergodic system, hence, by Fact lA.B. uncorrelated to any ergodic measure. 

We skip the proof which is, up to easy generalizations, implicitly included in 
Example 15.21 
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